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POWER REDUCTION CIRCUIT AND METHOD 
WITH MULTI CLOCK BRANCH CONTROL 

Field Of The Invention 

The invention relates generally to circuits and methods for reducing power 
) consumption for electronic circuits, and more particularly to circuits and methods that 
reduce power consumption for devices employing a memog^ntrgllgr^ 

Background Of The Invention 

5 Portable electronic devices such as notebook computers, personal organizers, 

portable telecommunication equipment and other electronic devices consume much 
power during their display mode. By way of example, graphics control chips for laptop 
computers may be integrated circuits having synchronous dynamic ram (SDRAM) on the 
same die as the memory controller, other video and graphics processors, and central 
to processing units if desired. For example, a conventional type of graphics control circuit 
may include a number of memory access request circuits (or access request engines) such 
as a video capture engine, a two dimensional and three dimensional drawing engine, a 
display engine, a video playback engine, a host processor, onboard SDRAM, SGRAM or 
other RAM serving as the frame buffer memory, a memory controller and a phase lock 
25 loop circuit (PLL) for generating a memory clock. As known in the art, each engine may 
have anotiier clock, other than the memory clock, such as from another PLL or external 
clock, creating a clock boundary. Graphics control chips typically also include another 
phase lock loop circuit for generating a clock for a display device (or devices) such as a 
CRT that may plug into the laptop computer or an LCD display that is mounted to the 
30 laptop computer. A central processing unit (CPU) of the computer interfaces with the 
graphics chip and other peripheral devices as known in the art. A laptop computer or a 
portable device may include a TV tuner or video decoder, as part of a multimedia 
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package, that sends video infonnation to the video capture engine for eventual display on 
the LCD display after being stored in the memory. 

With chips such as graphics controller chips, the many graphic engines 102 
attempt to access the memory to perform their necessary operations. However only one 
of the graphic engines can typically access the memory at a given time. In addition, 
multiple memory controllers may be configured to access different portions of memory at 
the same time. Some of the display operations require real-time processing, such as 
video capture operation, display operation and video playback, so that real-time display 
can occur on the LCD display, or any suitable display device. For example, where the 
TV tuner is applying video to the video capture engine, the video should be processed m 
xeal-time to facilitate display in real-time which is necessary, for example, for live 
performances or when the TV tuner is providing live feed. Hence this engine has a 
higher priority over, for example, a 2D or 3D drawing engine which may be sUghtly 
delayed and still provide the user with high performance on display times. 




. ^ problem arises with such devices since power consumption and thermal 
"^'lilsipatiXneeds to be minimized for portable devices without unnecessarily sacrificing 
operationaXerfo^^^^- Also, the same problem arises for non-portable devices due to 
0 increased cilit density and increased clock speeds. The power dissipation of a graphics 
control chip aiV other integrated circuits are typically related to the operational actmty 
of memory. CoVentional portable display systems typically have power management 
systems that gene\te system level standby/suspend commands. During system level 
standby/suspend mk^. graphics controller subsystems and other subsystems may 
,5 typically respond byWcing the frame buffer memory into a low power self refi-esh mode 
for the duration of theWstem level standby/suspend mode. This may be done for 
example by pulling a m\nory clock enable line low and other suitable pins to put the 
memory in a self refresh Vde. The synchronous memories are designed to switch mto 
energy savings modes basekn the level of the memory clock enable signal. In addition, 
30 the memory clocks, engine clV register clocks and other clocks are also disabled 
during suspend mode to save enei 
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FIG. 1 shows, by way of example, a block diagram of a portable display system 
used in devices such as a laptop computer, a handheld processing device, 
telecommunication device or any other suitable portable display device, that generates 
5 graphics and/or video display infonnation to a display device and employs system level 
standby/suspend power management control. When the display system is a laptop 
computer, such systems typically include an operating system 10 that operates under 
control of a central processing unit, for example, and a power management control 
system 12 which then generates a suspend/standby command 14 to a memory controller 
10 16. The memory controller 16 then generates a clock enable/disable signal 18 to, for 
example, control a memory clock enable pin (and/or other pins) on a graphics memory 
device 20 to put the synchronous memory in a self refresh mode during the system level 
standby/suspend mode (e.g., the inactive mode). The graphics memory device 20 may 
be, for example, an SDRAM, SGRAM or any other suitable graphics and/or video 
1 5 memory device. The operating system 1 0 generates a suspend/standby command 22 
when, for example, the laptop computer is in a standby/suspend mode as activated 
through a graphic user interface, software controlled timer, switch or other trigger event. 
Accordingly, such systems can reduce the power of the graphic subsystem which 
includes the synchronous memories used for frame buffer operations. A clock enable 
20 signal 18 is typically the memory clock enable 10 on the memory device 20. This control 
typically only puts the SGRAM or memory device in a low power mode when the 
^phic system is completely idle and tums off the memory clock and other clocks during 
the suspend mode. When the operating system 10 indicates that the display is in the 
active mode (e.g., the display is enabled), the memories are typically always enabled and 
25 the memory clock is always running. 

However, such systems do not typically provide suitable power reduction during 
the active operational mode of the system since the memory clock is always running 
regardless of which circuits may be idle during the active mode. For example, various 
30 memory request engines 24a - 24n. although in the active mode, may not be generating 
memory requests to memory controller 16. 
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In addition, many circuits employing memory controllers such as graphics 
accelerator circuits and video processing circuits may also use memory read data latching 
flops which latch data read from memory. Such latching circuits may include, for 

5 example, a plurality of flops designed to latch read data from memory based on a 
memory access request. Also typically, the latches are controlled by a memory clock 
circuit and if desired, an additional clock circuit to compensate for delays including 
circuit board line delays so that read data is not inadvertently skewed to the wrong clock 
cycle when read out from memory. These flops are typically always active. For 

10 example, the clock to the groups of flops are constantly running. During non-read cycles, 
undesired data is fed through the read path of the chip consuming power and potentially 
adding additional switching noise. 

Graphics processing circuits are also known to stop the engine clock and/or 
1 5 register clocks during suspend and standby modes. This typically prevents access to 
registers for the graphics processing circuits. Other graphics processing circuits are 
known which have activity-based power reduction by turning off, for example, a graphic 
user interface clock to a GUI generating engine and a three dimensional rendering engine 
clock to 3D engine when the 3D engine is not being used. This is typically done because 
20 the 3D engine, when operational, can consume large amounts of processing capability of 
the graphics processing circuitry. As such, branches from a common engine clock are 
selectively turned off based on an activity level. However, it is typically done for only 
the larger processing engines. In addition, where the engines utilize more than one clock, 
only one clock, such as an engine clock, is controlled. The memory clock in such 
25 engines is typically allowed to continue to operate during the active mode. 

Consequently, there exists a need for a dynamic power reduction circuit that can 
reduce power consumption and power dissipation without unnecessarily degrading 
system performance during active system modes. It would be advantageous, if such a 
30 system could detect memory access demand and automatically adjust memory operation 
accordingly to facilitate power reduction. 
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Brief Description Of The Drawings 

The invention described herein will be more readily understood in view of the 
5 following drawings where: 

Figure 1 is a block diagram generally illustrating a conventional graphics 
accelerator circuit; 

Figure 2 is a block diagram illustrating one example of a power consumption 
1 0 reduction circuit in accordance with one embodiment of the invention; 

Figure 3 is a block diagram illustrating an example of a power consumption 
reduction circuit in accordance with one embodiment of the invention. 

Detailed Description Of a Preferred Embodiment of The Invention 



Briefly, a power consumption reduction circuit and method utilizes a memory 
clock source and a memory clock divider circuit that generates divided memory clock 
output signals as a plurality of corresponding independent clock signals to a number of 
memory interface circuits for different processing engines. The memory clock provides 
20 clock signals to circuits such as a memory controller and memory interface circuit for 
memory request engines. The memory interface circuits are used for servicing and 
managing memory requests. A memory clock divider circuit and method selectively 
activates a plurality of independent clock signals in response to received condition data. 
Condition data may be activity based or user defined. Examples of condition data may 
25 include, for example, whether a primary or secondary display has been selected, whether 
a graphic user interface engine is active, whether a video overlay scaler has been enabled, 
whether subpicture operation has been enabled, and whether video capture operations 
have been enabled. If desired, the speed of the memory clock may also be sped up or 
slowed down depending upon the type of memory requests seeking data from a memory 
30 (such as a frame buffer). 
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<:L?y^ "inone embodiment, an engine clock source is also coupled through a switching 
^ circuit such^lmt it is selectively output to one or more processing engines, such as video 
overlay engineO^o capture engines or any other suitable data processing engine, 
storage circuits or oth^s^itable circuits. As such, the engine clock provides clock 
5 signals to memory requests^ndnes and other general purposes processing engines for 
example, programmable read wrifes;egisters. The switching circuit disables the output 
from the engine clock based on regist^^ndition data. Register condition data may 
include, for example, whether a multimedia^rt has been enabled, whether video capture 
processor has been enabled, whether standby mb^e has been enabled, whether a graphic 
10 user interface is active, or any other suitable conditi^i. 
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In addition, the power consumption reduction circuit and method may also 
include a plurality of memory read latch circuits and a memory read latch control circuit. 
The memory read latch control circuit is operative to dynamically activate and deactivate 
15 the plurality of memory read latches based on detected memory read requests to facilitate 
memory access activity-based power reduction. 

G. 2 shows one example of a power consumption reduction circuit 200 having a 
memory clocks5tifc^02 and a memory clock divider circuit 204 operatively coupled to 
a pluraUty of memory reallateh^cuits 206. The plurality of memory read latch circuits 
206 receives memory data 208 that isr&ad^m memory such as frame buffer memory 
(not shown). The memory read latch control cir&tiit204 dynamically activates and 
deactivates the plurality of memory read latches 208 through>^xontrol signal 2 1 0 based - 
on detected memory read requests 212. The detected memory read>eq^ts 212 may be 
25 any suitable memory access request from a suitable requestor, such as a gr^ic user 
interface engine, video overlay engine or any other suitable memory request engii 




The memory read latch control circuit 204 includes a read data latency 
compensation circuit 214 which determines how long it takes for memory to fetch data. 
30 This may be a preprogrammed register, for example. As known in the art, a memory read 
request 212 indicates typically how long to leave the read latches 206 on and also the 
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read latency period. The memory read latch control circuit 204 includes control logic 
216 that receives the memory read request 212 and obtains the read data latency 
information and generates a read latch control signal 218, which in this embodiment, 
indicates when to turn off the read latching flops during an active mode to reduce power 
5 consumption. The memory read latch control circuit 204 also includes an AND circuit 
220 responsive to the read latch control signal 218 and also responsive to a memory clock 
signal 222 generated by the memory clock source 202. The AND circuit 220 may be a 
hardware circuit, software or any suitable combination thereof. Likewise, the memory 
read latch control circuit 204 may be implemented using hardware, suitably programmed 
1 0 microcontrollers or other programmed logic circuitry, hardware state machine or any 
other suitable logic. The AND circuit 220 selectively enables and disables the memory 
read latches by generating the enable signal 210 as a function of the memory request 212. 
For example, if a memory read request 212 is detected, the read latch control signal 218 

'^Q is activated to allow the memory clock signal 222 to be passed through the AND gate 220 

fy 15 to provide the enable signal 21 0 to the plurality of read latching flops 206 for the time 
determined from the read data latency to allow the suitable data to be obtained from 
memory. It will be recognized that any suitable gating logic may also be used, such as a 
multiplexing circuit 230 where one input is tied to an idle value or any other suitable 

S logic. 

isKf 20 

As shown, the circuit 200 may also include a multiplexer 230 having an output 
232 which may be the memory clock signal 222 or any other signal input to the 
muUiplexer as selected by clock select signal 234. As such, in this embodiment, the 
multiplexer 230 has an input coupled to receive the memory clock signal 222, an input 

25 coupled to receive a memory clock feedback signal 236, a memory clock feedback signal 
with programmable delay 238 and an inverted memory clock signal with a programmable 
delay 240. The programmable delays 242a and 242b may be used, for example, to insure 
that timing is suitable for activation of the read latching flops based on layout delays 
caused by the location and length of signal lines on a printed circuit board or integrated 

30 circuit. 
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In operation, the memory clock feedback signal 250 obtained from an I/O buffer 
circuit 25 1 that receives a delayed memory clock signal from DLL 252. DLL 252 
attempts to mimic the delay of the memory clock to the memory. The multiplexer 230 is 
controlled to output a memory clock signal 232 or divided memory clock feedback signal 
5 236 or delayed memory clock feedback signal 238 or inverted delayed memory clock 
signal 240. In addition, it will be recognized that a DLL may not be used in which case 
delays can be compensated by any other suitable mechanism. 

The circuit dynamically turns on and off the read latching flops 206 based on the 
10 memory cycles. The plurality of flops are only enabled for read cycles and a few cycles 
of margin around the memory read cycles as dictated by the latency compensation 
duration. During other cycles, the read flops will not be active thereby reducing power - 
consumption and noise. This noise reduction can be useful with wider memory bus 
widths. Moreover, since data such as undesired data from read latching flops is not 
1 5 causing toggling during non-read conditions, additional power savings may be achieved. 

^'^■^^^^ K^G. 3 illustrates an example of a power consumption reduction circuit 300 
r employing aihemory clock divider circuit that is responsive to memory clock condition 
data 304, and an ^i|kie clock circuit 306 responsive to engine clock condition data 308. 

20 Examples of engine cloc^^smidition data include, for example, data representing whether 
I2C is enabled or whether a multimedia port is enabled. These may be user defined. 
Other examples include data represehtmg a host write buffer is not empty, whether video 
capture engine is enabled, whether an ov^ay scaler is enable, whether a GUI engine is 
active and any other suitable condition. As sh^wn in this embodiment, a memory 

25 controller 3 1 0 may include, if desired, the powerbtmsumption reduction circuit 200 to 
control the memory read latches as previously describedsHowever, it will be recognized 
that the circuit 302 and 306 may be used each alone or in cohiMnation with one another 
and with the power consumption reduction circuit 200. Also sho^s:^m this embodiment 
is a variable memory clock speed control circuit 312, such as that desc^la^ in co- 

30 pending patent application having attorney docket number 0 1 00 .0 1 25 3 , Serial^. 

09/130,746, filed on August 7, 1998, entitled "Dynamic Memory Clock Control S>^em 
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C^'^ an Method", by Lee et al., owi^d by instant Assignee and hereby incorporated by 

reference. However, it will be rtecognized that the variable memory clock circuit may be 
omitted or that any suitable variame memory clock speed control circuit may be used. 

5 The memory clock divider circuit 302 receives the memory clock signal 222 from 

a suitable memory clock source. The memory clock divider circuit 302 generates divided 
memory clock output signals 3 20a-320n. The divided memory clock output signals 
320a-320n are a plurality of independent clock signals in this embodiment, shown as 
multiple independent branches. A memory clock divider circuit 302 selectively activates 
1 0 at least some of the plurality of independent clock signals in response to received 
memory clock condition data 304. Examples of memory clock condition data may 
include activity-based, user defined condition data associated with differing data 
i=i processing units or request data. 

ry 1 5 In this embodiment, the example of the memory clock divider circuit 302 includes 

^ a plurality of AND circuits 322a-322n wherein each AND circuit outputs one of the 

!5 plurality of corresponding independent clock signals 320a-320n. Although not shown, it 

ii ~ will be recognized that suitable anti-glitching circuitry can be used to remove glitches 

when stopping and restarting the clock. Each AND circuit is coupled to receive different 
20 condition data 304 associated with different condition data sources, such as CRT 

controllers, frame buffer controllers, GUI engines, video scalers, video capture engines 
and any other suitable condition data source. 

By way of illustration, AND circuit 322a receives condition data CRT controller 
25 (CRTC) enable data, which may be, for example, a register bit indicating that a cathode 
ray tube has been enabled as the primary display device using the primary CRTC. The 
CRTC enable data is indicated as condition data 324a. Similarly, where a plurality of 
displays are connected for use, another AND circuit 322b may receive second CRTC 
enable data 324b indicating that a secondary display has been selected so that the 
30 graphics processor will output display data to two different display units using two 

different CRTC's (or any other suitable display controllers). Since display units may be 
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of different types, additional condition data may be logically coupled as being required 
before the memory clock to that engine is disabled during normal operation. As shown in 
FIG. 3, a half frame buffer signal 326 may serve as an input through an OR circuit 328 
whose output serves as condition data 324b. As such, it will be recognized that 
additional condition data may be logically configured to provide a suitable enabling or 
disabling of a memory clock to a selected processing engine as desired. 

ch a combination of condition data is shown for AND circuit 324c. In this 
example, the^i?qnhic user interface engine receives the independent clock signal 320c 
only after GUI activfes;ondition data 330 and GUI write data 332 are active. The GUI 
write condition data 332 mt^Qates that the GUI write buffer is not empty so that the 
system knows when the GUI engiitosisfmished writing to memory. Other condition data 
for the memory clock divider circuit 302^hGludes scaler enable data 334, subpicture "on" 
Q data 336 indicating that the subpicture operation Ras^tieen selected, video capture enable 

1 5 data 338, half frame buffer data 34 and scaler enable data542v.,.gOTidition data 338, 340 
and 342 are used for a video capture engine so that the memory clock tolhevs^eo capture 
engine will be disabled if none of the video capture related condition data is m« 
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O If desired, individual buffer circuits 342a-342n may be operatively coupled 

Q 20 between a requestor or engine and the memory clock divider circuit 302. The divided 
memory clock signal into independent clock signals or multiple independent branches, 
'0 allows the fine tuning of memory clock inputs to provide power reduction on an activity- 

based basis or a user-elected basis during normal operation. As such, if user-defined data 
is used, a graphic user interface may be supphed to a user to select which engines are to 
25 be disabled (by way of disabling the memory clock input) during normal operation. As 
such, a selection of the available engines may be provided visually and the user may then 
directly or indirectly select the corresponding engines to be disabled. This data may then 
be stored so that only certain AND gates 322a-322n are enabled (through enable lines not 
shown). It will be recognized that the memory clock divider circuit 302 may be 
30 implemented via software or a combination of hardware and software if desired. 
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Bngine clock circuit 306 includes a engine clock signal 360 coming from an 
/ engine clock source as known in the art. The engine clock signal is independently and 

selectively disabled for any suitable engine. As shown, a switching circuit 362 generates 
an outp It clock signal 364. Engine clock condition data is used to selectively couple the 
5 clock signal to at least one of a video overlay engine, a video capture engine, serial 

interface (e.g., I2C type) control logic and a multimedia port (MPP), or any other suitable 
processing engine such that the switching circuit 362 disables the output clock signal 364 
based c n condition data. By way of illustration, the switching circuit 362 may include an 
OR circuit 364, and AND circuit 366 and another AND circuit 368. The AND circuit 
366 receives the engine clock signal 360 and a suspend/standby conunand signal 370. 
The eni;ine clock signal 360 is then passed through the AND gate 366 when the suspend 
and sta idby mode is inactive, indicating normal operating mode. The OR gate 364 
serves to allow any of the condition data to output the engine clock signal 360 through 
AND sate 368. In this example, OR gate 364 receives I^C enable data, multimedia port 
enable data, host write buffer data indicating whether the write buffer is empty, video 
capture enable data, overlay enable data, GUI engine active enable data, register access 
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ndicating that registers are being programmed), or any other suitable condition 
As such, an I^C engine 374 or a multimedia port 376 may have the engine clock 
selectively enabled or disabled based on requisite condition data. This faciUtates a 
20 fine t ming capability from a power consumption perspective to allow both the engine 
clock and memory clock to be selectively disabled based on user selected conditions or 
activ: ty based conditions. For example, where the system is employed in a graphic 
accel :rator chip, the memory clock to the video overlay engine may be disabled by 
provi iing a scaler enable signal as user selected such that the memory clock signal 320d 
25 is no sent to the overlay scaler. In addition, an overlay enable signal may also be used as 
condition data to the engine clock control circuit 306 to prevent the associated registers 
for t le overlay engine to be disabled by effectively disabling the engine clock to the 
ovei lay engine registers. In this way, a host processor and/or graphics processing engines 
can] lot access the requisite registers associated with the video capture or video overlay 
30 engine. 
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In addition, an inverse discrete cosine transform (IDCT) AND gate 380 is 
provided to allow selective engine clock control for IDCT-related registers. This helps 
facilitate power reduction when video decode processing operations are not being 
performed. 

It should be understood that the implementation of other variations and 
modifications of the invention in its various aspects will be apparent to those of ordinary 
skill in the art, and that the invention is not limited by the specific embodiments 
described. For example, although AND gates and OR gates are shown, any suitable 
gating circuits or logic may be used. It is therefore contemplated to cover by the present 
invention, any and all modifications, variations, or equivalents that fall within the spirit 
and scope of the basic underlying principles disclosed and claimed herein. 
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